Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 30 November 2010 (MN MfcjX style file v2.2) 



Star formation trends in high-redshift galaxy surveys: 
the elephant or the tail? 



Martin Stringer 1 , Shaun Cole 1 , Carlos S. Frenk 1 & Daniel P. Stark 2 

1. Institute of Computational Cosmology, Department of Physics, University of Durham, South Road, Durham DH1 3LE 

2. Kavli Institute of Cosmology & Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CB3 0HA 



30 November 2010 



ABSTRACT 

Star formation rate and accummulated stellar mass are two fundamental physical 
quantities that describe the evolutionary state of a forming galaxy. Two recent at- 
tempts to determine the relationship between these quantities, by interpreting a sam- 
ple of star-forming galaxies at redshift of z ~ 4, have led to opposite conclusions. We 
use a model galaxy population to investigate possible causes for this discrepancy and 
conclude that minor errors in the conversion from observables to physical quantities 
can lead to major misrepresentation when applied without awareness of sample se- 
lection. We also investigate, in a general way, the physical origin of the correlation 
between star formation rate and stellar mass within hierarchical galaxy formation 
theory. 



1 INTRODUCTION 



As more distant galaxy populations become accessible to 
modern surveys, astronomers are striving to estimate their 
physical properties, despite the challenges inherent in such 
pioneering tasks. Light which barely registers on our instru- 
ments is analysed to infer the stellar mass and star forma- 
tion activity of its source, providing valuable stepping stones 
on which our physical picture of structure formation can 
progress. 

For example, Stark et al. (2009) produced estimates of 
stellar mass for 1038 galaxies from the the GOODS survey, 
grouped into three populations by redshift: z ~ 4, 5 and 6. 
These stellar masses were estimated using a population syn- 
thesis model (Bruzal & Chariot 2003; Bruzal 2007) which 
searches for the stellar population which best fits the ob- 



served spectral energy distribution of each galaxy (see [ 3.4 1 



Star formation rates were specifically not derived for 
this sample, because of uncertainties in the extinction cor- 
rection. In lieu of this, the galaxies' "emerging" UV lumi- 
nosities were computed (the luminosity at 1550A without 
any dust correction). However, figure 9 of Stark et al. (2009) 
does include the star formation rates that would be inferred 
if a standard proportionality between UV luminosity and 
star formation rate were assumed (Madau, Pozzetti & Dick- 
inson 1998, see Appendix[B|. This figure, for the galaxies in 
the nearest of the three samples, is reproduced for reference 
in the upper panel of our Fig[l] 

Despite only a fleeting appearance in the observational 
paper, these star formation rate estimates have since been 
the subject of quite detailed theoretical analysis. Dutton et 
al. (2010) summarise the trend given by the sample in Fig[l] 
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Figure 1. The estimated star formation rates and stellar masses 
for a population of galaxies observed at redshift, z f» 4 by Stark 
et al. (2009). The upper panel shows the star formation rate as 
the y-axis, and it is on these axis that the trend indicated by the 
dashed line jl| was found to fit the data by Dutton et al. (2010). 
The lower panel shows the specific star formation rate as the 
y-axis. In this plane, a quite different correlation is apparent, as 
noted by Khochfar & Silk (2010). The red dotted line shows the 
completeness limit in UV magnitude. 
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which implies that the specific star formation rate {Mi, /Mi,) 
is only weakly dependent on the stellar mass. Meanwhile, 
when studying the same sample of observational estimates, 
Khochfar & Silk (2010) set out: 

"To recover the strong observed mass-dependence of the 

specific star formation rate..." 

So the same sample has been interpreted, on the one 
hand, as having a strong correlation with stellar mass and, 
on the other hand, a weak correlatiorQ What is the reader 
to conclude from this literature? 

The confusion can be appreciated by comparing the two 
panels in Fig. [T] The trend does not seem unreasonable 
when looking at the top panel, but the problem is that the 
observational limit in UV magnitude creates a correspond- 
ing limit in star formation rate (dotted red line in Fig. [T]). 
Because of the greater abundance of fainter galaxies, the 
population piles-up against this limit. 

Consequently, when the sample is then plotted on the 
axes in the lower panel (in order to investigate the specific 
star formation rate), the dominant feature that translates to 
the new axes is a dense locus of galaxies around the obser- 
vational limit. As this forms a constant line on the y-axis in 
the upper panel, when divided by the x-axis (stellar mass) 
to create the lower panel it translates to " constant / x" , thus 
giving a the impression of "strong mass-dependence" . 

Meanwhile, at lower redshifts, there is similar confusion. 
For example, using far infrared luminosity as a tracer for star 
formation rate, Pannella et al. (2009) are led to conclude: 

"within the explored mass range, the SSFR of z ~ 2 star-forming 
galaxies is almost independent of stellar mass. " 

Conversely, the star formation rates measured by Rodighiero 
et al. (2010), also using the UV luminosity, suggest: 

"A negative trend of SSFR with mass is evident at all redshifts, 
although the scatter is quite large." 

This result is corroborated, amongst others, by Dunne et al. 
(2009), but with heavy caveats: 

"In summary, many independent studies find similar trends, 
with SSFR increasing with redshift and decreasing with stellar 
mass. However, the strength of these relationships varies con- 
siderably and is likely to be due to a complicated combination 
of sample-selection criteria, particularly wavelength and depth. 
Many authors make strong statements about down-sizing, based 
on the evolution of SSFR with redshift in different stellar mass 
bins, and on the oft-strong correlation of SSFR with stellar mass. 
We would urge caution before over-interpreting this type of plot 
as it is influenced strongly by selection biases." 



1 Both sets of authors agree on the relative evolution in spe- 
cific star formation rate implied by the data when compared with 
equivalent relationships at low redshifts, and that this evolution 
seems to cease (appear constant) for z ^ 4. Dutton et al. (2010) 
explain this in terms of high gas densities, and thus higher star 
formation rates, for a galaxy of a given mass at higher redshift. 
Khochfar & Silk (2010) look for modulated models of accretion- 
driven star formation. In this paper, we focus on the extent to 
which the data may or may not reveal the true underlying evolu- 
tion (Q. 



On this note, it is clear that further theoretical interpreta- 
tion of patterns in the M* — M* plane requires an under- 
standing of how the points arrive on these axes. 

In ij3] we confront the pitfalls in the process of trans- 
lation between observable and physical properties by look- 
ing at a plausible model galaxy sample, both in its entirety 
and through the restrictions of an observational survey. The 
model we use to generate this mock population is a version 
of GALFORM (see Appendix |Aj which is a development of 
the model applied in Bower et al. (2006). No claim is made, 
in this context, that this is the correct physical picture. All 
that matters for this exercise is that we are using a realistic 
model, based on current understanding of the physical pro- 
cesses involved, and that this particular observational sam- 
ple could have been drawn from the model population. This 
caveat allows us to focus on the investigation in hand: the 
difference between what would be inferred from a sample, 
and the true characteristics of the population from which 
was drawn. In Q this investigation is extended to look at 
the appearance of populations at a range of redshifts. 

Before carrying out these simple but instructive exer- 
cises, we momentarily set the issue aside to consider what 
general hierarchical galaxy formation theory brings to bear 
on the M t — M* relationship, in the absence of any particular 
model or agenda (^2|. Clearly this is required for a proper 
interpretation of the observational data: 

"Constraining the nature of the physical processes by which spe- 
cific star formation rates are kept approximately constant in star- 
forming galaxies of wildly different mass [presents] substantial 
challenges for theoretical models to reproduce." (Pannella et al. 
2009) 



2 SPECIFIC STAR FORMATION RATES 

The two quantities that were involved in Fig. [T] are stellar 
mass, M^, and star formation rate, M*. These are related 
by definition: 



M*(z) 



f(t z -t)Mi,(t)dt 



(2) 



where /(At) is the fraction of the initial stellar mass that 
has been retained by a stellar population after time, At, 
since formation. This is related to the recycled fraction, 
R, that usually appears in galaxy formation models. In 
the instantaneous recycling approximation, R is a constant: 
R=l-f*'f(t)dt. 

Of course, when referring to |2|, one must not forget 
the hierarchical assembly of the final system. Some stars 
that are present in a galaxy at time t z would have formed 
in a separate, smaller system at earlier times. So a more 
explicit version of ([2| would be: 



M*{z) 



/"£/(*. 

Jo 



t)Ma,(t)dt 



where the index, i, runs over all the progenitor galaxies 
which merge into the final host. From hereon in, this bulky 
notation is assumed rather than repeatedly stated. 

To provide some preliminary insight into the likely final 
relationship that emerges between M* and M*, equation |2| 
can be rewritten: 
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Figure 2. The connection between star formation histories and common specific star formation rates. The peripheral panels show the 
star formation rate vs. time for five galaxies, plotted in units of the value at the final time t = t4 (in this case at redshift 4). The main 
panel then shows the location of these five galaxies on a plot of final specific star formation rates and stellar mass, in the context of the 
population as a whole. The area under the curve in the peripheral panel dictates the vertical location of that galaxy in the main panel. 
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The integral which remains in (J3J) is just the integral of the 
star formation history in units of its current value. This is a 
quantity which could potentially be largely system indepen- 
dent, if the hierarchical assembly of a halo can be approxi- 
mated as self-similar. This would be expected over a certain 
mass range in a cosmology with conditional mass function 
index close to unity (Parkinson, Cole & Helly 2008). 

For the galaxies within the halos, such underlying sim- 
ilarity can be broken by various factors. At low mass mass, 
it is broken by cooling thresholds and, at high mass, by 
long cooling times and processes such as active galactic nu- 
clei. However, at intermediate masses, the underlying self- 
similarity in the halo assembly should carry through quite 
well to the galaxy population. In other words, hierarchical 
formation theory leads us to suspect that: 



/(*. 



x MM) , 
t) . y ' At i 
MM Z ) 



(1-R)t, 



(4) 



where r z is some timescale that is common to all galaxies 
at that redshift]^] This would lead to a direct proportional- 
ity between stellar mass and star formation rate at a given 
redshift: 



M-,, 



(1 - R) r z 



(5) 



To explore this supposition, we look at a model galaxy 
population generated using the GALFORM model. The un- 
derlying population of halos (with total host masses above 
10 9 /i -1 Mq at 2 = 0) is generated in a comoving volume of 
10 s Mpc, assuming standard cosmological parameters (Ko- 
matsu et al. 2010). These halos are then populated with 
galaxies using parameters and assumptions detailed in Ap- 
pendix [A] The evolution of subhalos is followed by the 
model, right down to a subhalo mass of 1O 7 /i _1 M , but we 
do not include satellite galaxies in this part of the discussion 
as they represent a distinct population governed by their 
own particular evolutionary characteristics. The main panel 
of Fig. [2] shows all the central galaxies in this population in 
the M-t, — M*/M* plane, as they are at redshift, z = 4. 

Now, equation |5| says that the ratio of the current star 
formation rate to the mean star formation rate in a galaxy 
rarely differs significantly from one system to another. It 
does not say that the systems have a constant star formation 
rate. Far from it. This important point is illustrated in the 
peripheral panels of Fig. [2] which show the star formation 
rate history of five galaxies in the model population, plotted 
as a fraction of their final value. What brings three of them 
onto the main trend is not that they have constant or similar 
star formation histories, but that their varied and sporadic 
histories are subject to hierarchical assembly in the same 
cosmology. Put another way, if the star formation rate had 
persistently differed from its current value, the system would 
have just ended up with a different stellar mass (and be in 
a different position but on the same trend). 



The lines of relative star formation rate must all con- 
verge on unity at t = to. Though all deviate significantly 
from this over the course of their history, as mergers and 
instabilities create bursts and lulls of star formation, the in- 
tegral under this line invariably ends up being about the 
same value (unless you happen to catch a system at the 
height or tail of one of these episodes, as in the left corner 
panels of Pig. [2J . This integral is just our timescale from Q, 



2.1 Insensitivity to star formation processes 

Equation (JsJ describes the trend which connects the star 
formation rate of a galaxy at redshift, z, to the host stellar 
mass. It is motivated solely by the argument that the star 
formation histories of all centra^] galaxies are subject to the 
same principal constraints; the age of the universe and nat- 
ural statistical fluctuations due to mergers and instabilities. 

This argument is not an attempt to evade the impor- 
tant complexities of star formation. Rather, we wish to be 
realistic about what the observed M» — M* trend can teach 
us. The fact that details were not needed to support the 
argument suggests that documenting the main trend at any 
particular redshift, may not help us distinguish between dif- 
ferent proposed theories of star formation and feedback. 

Over a range of redshifts, the evolving position of the 
trend (i.e. the value of t z ) may provide more clues (see fj4|, 
but only given an assumed halo merger history. The informa- 
tion that is really being provided by such surveys concerns 
the typical mass assembly history. Part of this story is in- 
deed the star formation process, but it is very difficult to 
separate this from the dominant influence of the structure 
formation process which is, as we have argued, ultimately 
responsible for pi). 

The characteristic which is sensitive to star formation 
and feedback physics, even over a small redshift range, is 
the position of the population along the trend. If star for- 
mation and/or feedback had been different, each halo would 
have ended up hosting a different stellar mass. The galaxies 
would still have every reason to appear on our trend, but to 
the left or right of their original position, not perpendicular 
to it. This has been previously noted (Stringer 2008; Dutton 
et al. 2010). So, given a known or assumed halo population, 
it is the luminosity function that can be used to constrain 
theories of star formation and feedback, whilst direct mea- 
surements of star formation rates, oddly, may not. 

Having made these few simple points about theoretical 
expectations, we now move to the more pertinent matter 
of how the reality of these physical properties might be re- 
vealed. The model from Fig. [2] can be approached as if it 
were real data and the process of observation followed to 
find out if and where pitfalls in the interpretation of the 
data may lie. 



2 For example, if all galaxies formed at t = Q and each had 
(its own particular) constant star formation rate, this common 
timescale would just be the age of the universe: t z = t z . 



3 The positions of satellites in the M* — M t plane are more sensi- 
tive to the details of star formation physics, and are mostly found 
off the main trend (Lagos et al., in prep.), hence our restriction 
of the above argument to central galaxies. 
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Figure 3. The apparent magnitudes of the real sample (dots) 
and model sample (crosses). The x-axis is the rest-frame visible 
and the y-axis is the rest-frame UV magnitude. The range covered 
by these axes is highlighted in Fig. [1] which shows the absolute 
magnitude of the same model galaxies, but includes the entire 
population in addition to this magnitude-limited sample. 



3 TRUE AND INFERRED POPULATIONS 

To illustrate the importance of sampling effects in high red- 
shift surveys, the generated population of galaxies from S|2] 
can be analysed under the same observational constraints, 
and using the same techniques as were applied by Stark et 
al. (2009) to the real data. 

Fig.[3]shows the real and model samples on a plot of ob- 
served quantities: rest frame visible magnitude vs. rest- frame 
UV magnitude]^] The two sets of points are somewhat offset 
from each other, but the statistical significance of this differ- 
ence is low; the majority of the population (i.e. the fainter 
galaxies) are overlapping. So, for the purposes of this purely 
illustrative exercise, we consider this model to be an accept- 
able match to the data. (For a discussion of discrepancies 
that exist between current semi-analytic models and recent 
observations, the reader is referred to Lacey et al. 2010). 

With this caveat, we proceed to follow our model sample 
all the way through from the "real" physical parameters 
to the magnitudes that would be observed, and then back 
again to the inferred physical parameters. This process from 
physical quantities to observables, and back, is shown as 
a sequence of panels in Fig. [4] Each transition (clockwise) 
from one panel to the next introduces one part of this chain, 
as follows: 



4 Throughout this paper, we use the simple magnitude notation 
M x (Absolute Magnitude rcst _ framo wave i eng th) and 
m x (apparent magnitude obscrvcd waV clcngth)- The band filters 
used to calculate the magnitudes in the figures were matched 
precisely to those in the observational survey. 



3.1 Dust 

The top left panel of Fig. [4] shows the star formation rates 
and stellar masses of the model galaxy sample. Immediately 
to the right of this is shown the mapping to absolute rest- 
frame UV magnitude. Whilst the scatter in the relation from 
SFR to initial UV emission is worth understanding (jp3|, it 
is very minimal. The real problem in any efforts to derive 
the SFR is the effect of intervening dust on the UV emission. 

An estimate of this effect is included in the model, af- 
ter Ferrara et al. (1999), by following the radiative transfer 
of light (at all wavelengths) through dust assumed to be 
distributed smoothly in the galactic disk. Metallicities are 
included in the calculation, and inclination angles are as- 
signed to each galaxy at random. For full details, the reader 
is directed to Cole et al. (2000) and Lacey et al. (2010). 

The top right panel shows the correlation between the 
absolute UV magnitude and star formation rate. For com- 



parison, a dashed line ({ 3.3 1 shows the relationship that will 



be assumed when mapping back from the UV to the SFR. 
Unsurprisingly, the effect of dust has been both to intro- 
duce scatter and to reduce the UV luminosities with respect 
to this estimate. The systematic effect of continuing to use 
this relationship (dashed line) can be appreciated from the 
remaining panels. 



3.2 Observational limits 

The middle right panel of Fig. [4] shows our model galaxy 
population in terms of two estimated observables, the UV 
and visible absolute magnitudes. This is the poin^at which 
we can turn the process around and analyse the sample to 
see how well we can recover the physical properties of the 
population. 

The choice of observational limits used for this illus- 
trative exercise are taken from Stark et al. (2009), namely 
that galaxies are included in the survey if their apparent 
magnitudes in the rest-frame UV satisfies 771775 < 27. Those 
sources that are fainter than 7713600 ~ 27 are not detected in 
that filter, but will still be included in the sample and their 
stellar masses computed using the measured 2-sigma upper 
limits. 



3.3 Inferred star formation rate 

The bottom right panel of Fig. [4] begins the mapping back 
to physical parameters. We apply the direct proportionality, 
used to produce Fig 9 in Stark et al. (2009), between abso- 
lute UV magnitude M\), and inferred star formation rate, 
M*: 



log 



Moyr- 1 



Miso + 18.45 
2.5 



(6) 



This simple conversion is well founded by our knowledge of 
stellar evolution, as explained by the proponents: 

"The UV continuum emission from a galaxy with significant 



Since both the galaxy formation model and the post- 
obervational analysis assume the same cosmology and model for 
IGM absorption, we neglect these parts of the process here. 
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Figure 4. An illustration of how a high redshift galaxy population could be misrepresented by a magnitude-limited sample. The top 
left panel shows the physical properties (total stellar mass, M*, and current star formation rate, M*) of a model galaxy population 
within a volume of 10 5 Mpc 3 h -3 at redshift, z = 4. The panel immediately to the right shows the translation to absolute magnitude, 
M\. The central panel shows the same rest-frame UV magnitudes together with their optical counterparts, Highlighted in red are the 
systems that would be seen in a survey such as that by Stark et al. (2009), with approximate magnitude limits as indicated by the dotted 
lines. The bottom panels show the assumed mapping from rest-frame UV magnitude back to star formation rate and then to stellar 
mass. Diagonal dashed lines indicate the position of equation |6]l and solid diagonal lines highlight the approximate trend in the top-left 
panel j5|. 
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ongoing star formation is entirely dominated by late-O/early-B 
stars on the main sequence" (Madau, Pozzetti & Dickinson 1998). 

The need to worry about the criterion of "significant 
ongoing star formation" is explored in S|B] The principal 
conclusion of this section is that this concern is very minor, 
particularly at the visible end of the luminosity range. 

Much more of a problem is the scatter due to dust, 
which was discussed in £3.1 particularly when |6j| is used 
to try to recover the star formation rates in spite of this 
scatter. Because of the increased number of galaxies towards 
the limits (fainter galaxies are more abundant), this working 
assumption has quite drastic effects on the appearance of 
the population when the final step is made back to inferred 
physical properties. 

Because of the relative abundance of fainter systems, 
there is an accumulation of points at the intersection be- 
tween the UV limit and the assumed SFR-UV relation (the 
two lines in the bottom right panel of Fig. [2|. When the 
final step is taken to map back to the original axes (the 
lower left panel), this concentration of data points spreads 
out across a range of values for stellar mass and creates the 
impression of constant star formation rate that we saw in 
Fig.[l] Crucially, the proportionality from the top left panel 
(reproduced again as a line in the bottom left) has been 
completely lost. 

Additionally, because the inferred SFR are lower than 
the original values, not only has the M* — M 4 correlation 
been lost, but the overall position of points in the plane has 
shifted. This systematic underestimation of star formation 
rates will be particularly important when trying to map the 
evolution of the relationship with redshift, as we show in ^ 



3.4 Inferred stellar mass 

To find the stellar mass that would be inferred for the model 
galaxies, the estimated apparent magnitudes from the mock 
sample were processed using the same model (Bruzal 2007) 
that was applied by Stark et al. (2009) to find the stellar 
masses of the real galaxies. This valuable exercise produced 
stellar mass estimates which were in excellent agreement 
(The fractional difference, e = Inferred mass / "True" mass, 
was found to have a standard deviation of just 0.3). 



the other extreme, as gas is exhausted and the growth of Ha 
suppresses further accretion, r z — > oo. 

In between times, while gas is plentiful but the age of 
the universe, t(z) >> r*, the fluctuations in star formation 
are ironed out, leading to much less scatter about the trend, 
which becomes comparable to t z ~ t(z). This is the era we 
are in at present. 

Fig. [5] illustrates this argument using the same model 
population considered in Figs, [I] and [4] but now seen at a 
range of redshifts. As time progresses (from right to left), 
we see the trend narrow and drop in accordance with the 
argument above. This evolution is also seen directly in Fig. 
[6] which shows t z against redshift. 

Highlighted in the upper row of Fig. [5] are the galaxies 
in the model population which would be visible in an ob- 
servational survey. At higher redshifts, this sample consists 
of brighter and brighter subsets of the population, which 
leads to a very strong bias towards select galaxies with high 
specific star formation rates. The mean rate, 1/ < t z >, is 
plotted in Fig. [6] as a function of redshift, and it can be seen 
that the value based on this visible sample diverges wildly 
from the true mean after about z ~ 2. 

A very separate issue, which is apparent in Fig. [6] is 
the evolution of the characteristic rates that would be in- 
ferred from this reduced sample. Using the process outlined 
in Sj3] the emission that is predicted by the model for each 
magnitude-selected sample is converted into stellar masses 
and star formation rates that would be inferred using, for 
example, the correlation Q. 

Also worth noting from this figure is the fact that 
none of the three lines follows the pattern that actually has 
been put together from collected observations (Dutton et 
al. 2010, and references therein), where the inferred char- 
acteristic rate is seen to flatten out at high redshifts. For 
the model to be consistent with these observations, it is of 
course the inferred line in Fig. [6] that should broadly agree 
with the published results. Clearly this is not the case, and 
this could be due to shortcomings of the model, or of the 
analysis applied to the observations. 

As far as the exercise in this paper is concerned, it is 
more important to emphasise, again, the discrepancy be- 
tween all three lines in Fig. [6j which calls into question how 
well the evolution in specific star formation rates has re 
been captured by surveys thus far. 



4 EVOLUTION OF TRENDS WITH REDSHIFT 

Section [2] established the argument in support of a common 
specific star formation rate for all central galaxies. Of course, 
this only applies at any one particular time; the constant of 
proportionality in equation |5]l , will be different for samples 
at different redshifts, hence the notation r z . 

To understand this evolution, we return to the picture 
used to support jH]), namely that r z is just the integral under 
the star formation rate history, in units of its current value. 
Two limiting cases are easy to identify immediately: at the 
highest redshifts, galaxies will be seen near birth and r z — ?> 
t*, the characteristic timescale for star formation itselfj At 



6 In the extreme limiting case, we would see the first star in the 
galaxy, and t z would be its age. 



5 SUMMARY 

Very different trends of stellar mass to star formation rate 
have been attributed to the same observational sample (Dut- 
ton et al. 2010; Khochfar & Silk 2010). In this paper, we 
have carried out a more rigorous investigation into the origin 
of these two key estimated physical quantities. 

A simple argument, based on hierarchical galaxy forma- 
tion theory, was presented to understand why a strong trend 
might exist between these two quantities, and that such a 
relationship can result from the self-similar nature of galaxy 
assembly, independently of star formation or feedback pro- 
cesses. 

Using a model galaxy population as a guide, we have 
shown that, due to the combined effects of selection bias 
and physical scatter in the relations between observable and 
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Figure 5. An illustration of how specific star formation rate could be independent of stellar mass but dependent on redshift, and how 
such evolution can be misinterpreted without a proper understanding of selection effects. Both rows of panels shows contour lines which 
represent densities in this plane of 2, 10 and 50 galaxies/dex 2 (10Mpc) 3 . The dots in the upper row are those galaxies that would be 
visible in a current survey (see j ]3,2| . The lower row shows these same galaxies, but with the specific star formation rate that would 
have been inferred from the uncorrected UV luminosity (see |3.3| . The characteristic rates for both the inferred sample and the true 
population are shown as points in Fig. [6] along with equivalent points for intermediate redshifts. 




2 4 6 8 

Redshift 

Figure 6. Characteristic growth rates, l/(r z ), for the true galaxy population and "observational" samples shown in Fig. [5] Open squares 
show the mean value for all the galaxies in the population. Filled squares show the value for just the galaxies that would be visible to 
observers under an exapmle UV limit of 01775 > 27 (derived from the dots in the upper row of Fig[5jl. Dots show the mean that would be 
inferred using the UV magnitude, without correction, to derive the star formation rate (derived from the dots in the lower row of Fig[5]|. 
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physical properties, this underlying trend can be easily mis- 
represented. There can also be a large discrepancy between 
inferred and true parameter values. 

These results highlight the importance of using realistic, 
physical galaxy formation models to guide the interpreta- 
tion of high-redshift surveys. By subjecting model-generated 
galaxy populations to the same analysis as the real data, ob- 
servation can be compared with competing theories on an 
even footing. In this way, new surveys can lead to more 
incisive quantitative conclusions about the true underlying 
galaxy population. 
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Table Al. Changes to the GALFORM model 



Parameter 


Bower et al. (2006) 


This version 


<*hot 


3.2 


2.5 


■"hot 


485 km s" 1 


300 km s- 1 


^reheat 


1.26 


0.3 


Q * 


-1.5 


-0.5 




350i dyn 


4 Gyr 


^cool 


0.58 


0.78 


Tmrg 


1.5 


1.0 


Vcut 


50 km s — 1 


30 km s" 1 



APPENDIX A: THE GALAXY FORMATION 
MODEL 

The mock galaxy population in the figures in this article 
was generated using a version of the GALFORM semi-analytic 
model which is currently under development. The goal of 
this new version is to combine the most realistic aspects of 
the two previously published versions (Baugh et al. 2005; 
Bower et al. 2006), whilst achieving a better match, than 
either of these, to current observational constraints. 

The development model is most closely related to the 
model published by Bower et al. (2006), but uses parame- 
ters for star formation and feedback that are more realistic; 
closer to those favoured by Baugh et al. (2005). A list of 
parameter changes appears in table [AT] The parameters t* 
and a* apply to star formation rate ip, as follows: 

These, and all other parameters, are as defined in Bower et 
al. (2006), and references therein. 

Other changes are the distribution of orbital parame- 
ters, which has been updated to follow Benson (2005), and 
the treatment of the cooling of hot halo gas, which now fol- 
lows Benson & Bower (2010). Full details of this version will 
appear in Lacey et al. (2011, in prep.). 



APPENDIX B: UV LUMINOSITY AS A STAR 
FORMATION TRACER 

To explore this relationship, we return to the model galaxy 
population that was shown in Fig. [2] Consider, first, their 
star formation rate vs. rest-frame UV magnitude, which is 
shown in the left main panel of Fig. |B1| The relation as- 
sumed in observational analysis is shown in the same panel 
as a dashed line, and it is immediately clear that such an 
approximation is good for the majority of the galaxy popu- 
lation. 

To clarify our understanding of this, the same star for- 
mation rates are shown in the right panel as a function of 
stellar mass. This shows the very strong correlation between 
current star formation rate, and mean star formation rate, 
which was discussed in Ej2] Now, for galaxies which lie on 
or above this trend, the massive star population is a large 
enough that they will indeed be the main contributors to the 
total UV luminosity, and the strong correlation |6| holds. 

For galaxies below the main trend, this approximation 
breaks down; less massive stars are so comparatively abun- 
dant that they are responsible for most of the total UV out- 
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Figure Bl. The scatter in the relationship between star formation rate and UV magnitude by incongruous star formation histories. The 
dots in the left central panel show the galaxies in the model population in the main text and the dashed line is the correlation of Madau, 
Pozzetti & Dickinson (1998). Four particular galaxies are circled and their star formation histories shown in small panels. For reference 
to physical properties, these same galaxies are also highlighted on a plot of star formation rate vs. stellar mass (right central panel). 



put, despite their poor individual contribution to this part 
of the spectrum. 

This is further illustrated by the small peripheral pan- 
els which show the star formation histories of four partic- 
ular galaxies. In the lower two panels, past star formation 
episodes were so productive that the stars produced then 
are outshining the recently formed stars, even at this high 
energy end of the spectrum. 

The main conclusion of this exercise is positive; hierar- 
chical formation theory predicts that only a small fraction of 
galaxies would differ from the assumed correlation. Further- 



more, such scatter as there is occurs mostly on the lower side; 
unusually high star formation rates may still be estimated 
correctly as it only serves to accentuate the underlying as- 
sumption Q. 

This one-sided nature of this error does means that 
characteristic star formation rates would be systematically 
overestimated but the practical consequences of this are neg- 
ligible, particularly when set aside the comparatively major 
problems presented by dust extinction, covered in §3.1| 



